Model Selection

Lightweight text generation

# Lightweight text generation

ERNIE 4.5 0.3B PT GGUF

This model is a GGUF format conversion version of Baidu's ERNIE-4.5-0.3B-PT, supporting Chinese and English text generation tasks.

Large Language Model Supports Multiple Languages

Jan-nano-8bit is an 8-bit quantized version converted from the Menlo/Jan-nano model, optimized for the MLX framework and suitable for text generation tasks.

Large Language Model

Huihui Ai.magistral Small 2506 Abliterated GGUF

The Huihui AI Quantized Model is a quantized version of Magistral-Small-2506-abliterated, dedicated to making knowledge accessible to everyone.

Large Language Model

Sentientagi.dobby Mini Unhinged Plus Llama 3.1 8B GGUF

This project provides a quantized version of Dobby-Mini-Unhinged-Plus-Llama-3.1-8B, aiming to make knowledge accessible to everyone.

Large Language Model

Dleemiller.penny 1.7B GGUF

Penny - 1.7B is a quantized version of a large language model dedicated to making knowledge accessible to everyone.

Large Language Model

Dmindai.dmind 1 Mini GGUF

DMind-1-mini is a lightweight text generation model suitable for various natural language processing tasks.

Text Generation

Mlabonne Qwen3 0.6B Abliterated GGUF

This is a quantized version based on the Qwen3-0.6B-abliterated model, using llama.cpp for quantization, suitable for text generation tasks.

Large Language Model

Qwen Qwen3 0.6B GGUF

This repository contains GGUF format model files for Qwen/Qwen3-0.6B, quantized by TensorBlock's machines and compatible with llama.cpp.

Large Language Model

Qwen3 0.6B GGUF

GGUF quantized version of Qwen3-0.6B, suitable for text generation tasks.

Large Language Model

Qwen2-96M is a miniature language model based on the Qwen2 architecture, containing 96 million parameters and supporting a context length of 8192 tokens, suitable for English text generation tasks.

Large Language Model English

Tesslate Tessa T1 3B GGUF

Tessa-T1-3B is a 3B-parameter large language model based on the Qwen2 architecture, offering multiple quantization versions to accommodate different hardware requirements.

Large Language Model English

Llama 3.1 8B RainbowLight EtherealMix GGUF

This is a quantized version in GGUF format based on the Llama-3.1-8B-RainbowLight-EtherealMix model, which facilitates the development of applications related to text generation.

Large Language Model

Qwen2.5 1.5B Instruct GGUF

The GGUF format file of the Qwen2.5-1.5B-Instruct model, suitable for text generation tasks.

Large Language Model

Gemma 2 2b It Abliterated GGUF

Gemma-2-2b-it-abliterated is a 2.2B parameter language model based on the Google Gemma architecture, optimized through quantization for text generation tasks.

Large Language Model English

Gemma is a lightweight open model series launched by Google, built on the technology used to create Gemini models, suitable for various text generation tasks.

Large Language Model

Gemma is a lightweight open-source large language model launched by Google, built with the same technology as Gemini, suitable for text generation tasks.

Large Language Model

Phi 3 Mini 4k Instruct Bnb 4bit

The 4-bit quantization version of Phi-3-mini-4k-instruct, quantized using the bitsandbytes tool, is designed specifically for fine-tuning.

Large Language Model

Qwen1.5 Moe Tiny Random

This is a small randomly initialized model based on the Qwen1.5-MoE architecture, using float16 precision, suitable for text generation tasks.

Large Language Model

Phi 2 Super GGUF

phi-2-super-GGUF is the GGUF quantized version of the abacaj/phi-2-super model, suitable for local execution and text generation tasks.

Large Language Model

Minueza 32M Base

Minueza-32M-Base is a base model with 32 million parameters, fully trained on extensive English text corpora, suitable for text generation tasks.

Large Language Model

Transformers English

Gemma is a lightweight open-source large language model series launched by Google, built on the technology used to create Gemini models, offering a base version with 2 billion parameters.

Large Language Model

Phi2 Chinese 0.2B

A 200-million-parameter Chinese causal language model based on the Phi2 architecture, supporting text generation tasks

Large Language Model

Transformers Supports Multiple Languages

Tinyllama V0 GGUF

TinyLLama-v0 is a lightweight language model provided in GGUF format, suitable for text generation tasks.

Large Language Model English

Puma-3B is a text generation model fine-tuned based on OpenLLaMA 3B V2. It is trained on the ShareGPT Hyperfiltered dataset and is suitable for various text generation tasks.

Large Language Model

Transformers English

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase